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JOINT  PROBABILITY  DENSITY  FUNCTION  OF  SELECTED 
ORDER  STATISTICS  AND  THE  SUM  OF  THE  REMAINDER 
AS  APPLIED  TO  ARBITRARY  INDEPENDENT  RANDOM  VARIABLES 


INTRODUCTION 


Detection  and  location  of  weak  signals  in  random  noise  is  frequently  accomplished  by  the 
ordering  of  the  random  variables  (RVs)  in  a  measured  dataset,  followed  by  an  investigation  of 
the  locations  and  statistics  of  several  of  the  largest  RVs  under  consideration.  Also  of  interest  are 
the  remaining  smaller  RV s  in  the  dataset,  which  can  be  used  to  estimate  the  background  noise 
level  and  to  form  a  basis  for  normalization,  thereby  realizing  a  constant  false  alarm  processor. 

In  this  study,  the  original  dataset  {x„ }  is  composed  of  N  independent  RV s  with  arbitrary 
probability  density  functions  (PDFs)  {p„(x)}.  This  dataset  is  ordered  into  another  dataset  of 

dependent  RVs,  each  with  a  different  PDF.  From  this  latter  set,  the  M- 1  largest  RVs  are 
selected.  The  sum  of  the  remaining  N+l-M  RVs  is  then  computed,  giving  a  total  of  M 
dependent  RVs.  The  joint  M-th  order  PDF  of  these  M  dependent  RVs  is  one  of  the  quantities  of 
interest. 

For  convenience,  the  following  notation  is  used.  The  largest  RV  in  set  {x„ }  is  denoted  by 
Zj,the  second-largest  by  z2 , . . . ,  the  M-l  largest  by  zw_, ,  and  the  sum  of  the  remaining  RV  s  by 
zM .  Thus,  the  first  M-l  RVs  satisfy  the  restrictions  that  z,  >z2  >•••>  zM_, ,  while  the  last  RV 
must  obviously  satisfy  the  restriction  that  zM  <(N  +  \-M)  zM  l. 

To  solve  for  the  statistics  of  RVs{zm},  general  M,  a  series  of  simpler  problems  will  be 

solved  starting  with  M  =  2,  that  is,  the  largest  RV  and  the  sum  of  the  remainder.  By  the  time  M 
increases  to  5,  the  general  pattern  will  be  obvious  and  may  be  extended  to  a  larger  M  of  interest. 
The  end  result  is  a  single  one-dimensional  contour  integral  for  the  joint  PDF  of{zffl  },  which  can 

easily  and  accurately  be  numerically  evaluated  by  moving  the  contour  of  integration  to 
approximately  pass  through  the  real  saddlepoint  of  the  integrand. 

For  later  use,  it  is  convenient  to  define  the  auxiliary  function 


c„  ( z ,  X)  =  exp(A  x)  pn  (x)  for  n  =  1 :  N, 

—oo 


which  is  a  mixture  of  a  cumulative  distribution  function  (CDF)  and  a  moment-generating 
function  (MGF).  That  is,  cn  (z,0)  is  the  CDF  corresponding  to  PDF  pn  (x),  while  cn  (+oo,  X)  is  the 
corresponding  MGF.  Variable  z  is  real,  while  X  can  be  complex.  Several  useful  examples  of  the 
cn  (z,  X)  function  are  listed  in  appendix  A  of  reference  1 . 
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DERIVATIONS  OF  JOINT  PROBABILITY  DENSITY  FUNCTIONS 


The  method  for  deriving  joint  PDFs  is  based  very  heavily  on  reference  1,  pages  5  through 
15.  The  notation  introduced  there  will  also  be  used  here.  The  reader  is  advised  to  review  that 
material  before  proceeding.  The  major  difference  here  is  that  each  RV  now  has  different 
statistics.  Specifically,  the  PDF  of  RV  x„  is  pn  (x)  for  n  =  1  :N,  instead  of  a  common  PDF  p{x) 

used  earlier.  The  CDF  of  RV  xn  is  cn(x),  while  its  exceedance  distribution  function  (EDF)  is 
en(x).  The  RVs{x„},  n  =  \:N,  are  independent  of  each  other. 


LARGEST  RANDOM  VARIABLE 

The  probability  that  RV xm  is  the  largest  RV  and  that  it  is  in  the  interval  (z,,z,  +  dz, )  is 


IV 

dz\  PMU'M- 


Thus,  the  probability  that  RV  x,„  is  the  largest  RV  is 


Q(m)  =  \dz ,  pm  (z, )  Yl  cn  Ui )  =  f dz  for  m  =  \:N, 

J  »=i  J  cjz) 


where  the  product  of  auxiliary  functions  is 


P{z,X)  =  Y\cn{z,X). 


The  sum  of  all  the  {Q(m)}  probabilities  is  unity: 


\dz  j^Pm{z)Y[c,Xz)=  \dz^Ylc»W  =  f[cMr-~  =1- 


W=I 


As  an  example,  if  all  the  RVs  have  exponential  PDFs,  namely, 

P„  (*)  =  exp(-a„  x)  U (x),  cn  (x)  =  [1  -  exp(-a„  x)]  U(x)  for  n  =  1 

then  it  follows,  from  equation  (2),  that 


-  A/ 

Q{m)  =  \dz  am  exp (~am  z)  {1  -  exp(-a„  z)}  for  m  =  1 :  N. 
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Although  these  integrals  can  be  evaluated  in  closed  form,  they  are  probably  most  efficiently 
accomplished  by  numerical  integration,  especially  for  large  N,  once  {an  }  are  specified 
numerically.  For  other  than  exponential  PDFs,  the  integrals  in  equation  (2)  will  have  to  be  done 
numerically. 

Given  that  RVz,  =  xm  is  the  largest  RV,  the  conditional  PDF  ofRVx„,  n  *  m,  at  argument  x, 
when  z ,  has  value  z, ,  is 


PMu^-x).  (7) 

The  corresponding  conditional  MGF  is 


\dx  exp(A  x)  = 

i  cn(z  i) 


n^m. 


(8) 


Therefore,  the  conditional  PDF  of  the  sumz2  of  the  remaining  RVs  (other  thanxm ),  at  argument 
z2 ,  given  thatz,  =  z, ,  is  available  from  a  Bromwich  contour  integral  as 


1 

i2n 


jdJl  exp (-A  z2 )  • 

J  ■  Cn(Z  l) 


n= 1 
n*m 


(9) 


Finally,  the  product  of  equations  (1)  and  (9)  and  dz2  is 

1  N 

dzx  dz2  pm (z, )  jdA  exp(-A z2)  it,  (z,,A),  (10) 

lln  c  n=l 

n±m 

which  is  the  probability  thatz,  (=  x,„ )  is  the  largest  RV,  that  it  lies  in  the  interval (z,  ,z,  +  dzx), 
and  that  the  sum z2 lies  in  the  interval (z2,z2  +dz2).  That  is, 

q2(m,zx,z2)  =  pm(zx)~  f^exp(-Az2)fJcn(z,,A)  forw  =  l:V  (11) 

l27t  c  n=\ 

n±m 

is  the  combined  probability  (thatxm  is  the  largest  RV)  and  joint  PDF  ofz,  (=  xOT  )  andz2  (which 
is  the  sum  of  the  remaining  RVs). 

Let  A  =  iy  in  equation  (11)  and  integrate  onz2to  get 

N  N 

\dz2  q2(m,zx,z2)  =  pm(zx)  \dy  S(y)\\cn(zx,iy)  =  pm(zx)  IRw-  (>2> 

n±m  n*m 
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Then,  by  use  of  equation  (2),  an  additional  integral  yields 

Jjc/z,  dz2  q2 ( m , z,,z2)  =  Q{m)  for  m  =  \\N.  ( 

That  is,  function q2{m,zx,z2)  in  equation  (1 1)  is  not  a  true  PDF  because  its  area  is  less  than  1. 
However,  the  conditional  PDF  ofz,,z2,  given  thatx;nis  the  largest  RV,  is 

1  lr  N 

q2(zl,z2\m)  =  ——pm(zl)—  \dAexp{-Az2)Y[cn{zvA),  ( 

n*m 

which  is  a  true  PDF  for  all  m  =  1  :N. 

Alternatively,  the  sum  of  equation  (11)  over  all  m, 

*  1  r  N  N 

qM,z2)^q2(m,zx,z2)  =  —  jdA  exp(-/l z2 )  £ pm (z, ) c„ (r, , X),  ( 


is  a  true  PDF,  namely,  the  unconditional  joint  PDF  of  the  largest  RVz,  and  the  sum  of  the 
remaining  RVsz2.  Using  equation  (3),  this  quantity  can  be  expressed  as 

?2 (*p *2 )  =  -^“  exp(-A z2 )  P(z,, A)  £ ■  P"(Z'L  ( 1 6 

i2n  l  ^c„(z,,A) 

which  avoids  the  nested  two-dimensional  operation  in  equation  (15)  in  favor  of  a  one¬ 
dimensional  product  and  a  one-dimensional  sum,  both  of  which  depend  on  the  variable  of 
integration  A.  This  joint  PDF  is  zero  for  z2  >(A-l)z,  becauseRVz2  <(A-l)z,  is  always 
true.  An  alternative  argument  is  given  in  equation  (16)  of  reference  1;  it  uses  the  fact 
that  c„  (z,  A)  is  analytic  in  A  for  Re(A)  >  An ,  a  problem-dependent  critical  value. 

As  an  alternative  check,  if  all  the  RVs{x„ }  are  identically  distributed,  equation  (15)  reduces 
to  equation  (15)  of  reference  1 . 


TWO  LARGEST  RANDOM  VARIABLES 

The  probability  that  RV  xm  is  the  largest  RV,  that  xk  is  the  second-largest  RV,  that  xm  is  in 
interval  (z,,z,  +£fe,),and  thatx*  is  in  interval  (z2,z2  +dz2),  z,  >  z2,is 


dzi  dzi  PJz>)Pk(zi)  flc«  (z2)f/(z,  -z2),  mitk. 
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Then,  the  probability  thatxm  is  the  largest  RV  and  thatx*  is  the  second-largest  RV  is  best 
obtained  by  integrating  equation  (17)  overz,  first: 


N  00 

Q(m,k)  =  jdz2  pk(z2)  n  c„(z2 )  \dzx  pm(zx) 

n-\  z2 

n*m,k 

N 

=  \dz2  em(z2)pk(z2)  ]lcn(z2) 


(18) 


n= 1 
n*m,k 


=  J<7zP(z,0) 


(*)/>*(*) 
cm{z)  ck(z ) 


form^fc,  m,k  =  \ :  N. 


A  single  integral  suffices  to  determine  this  probability.  The  sum  of  all  the  {Q(m,  k)}  probabilities 
for  m*k  is  unity. 


For  the  example  of  exponential  RVs  in  equation  (5),  the  integral  in  equation  (18)  takes  the 


form 


Q(m,  k )  =  \dz  ak  exp[-(am  +ak)z]Yl^~  exp  (~a„  z)}  for  m*k 


(19) 


W= 1 

n*m,k 


Again,  although  possible  analytically,  numerical  integration  is  the  most  practical  method. 

Given  that  RVz,  =  xm  is  the  largest  RV  and  thatz2  =  \k  is  the  second-largest  RV,  the 
conditional  PDF  ofx„,  n  *  m,k,  at  argument*,  when  z,has  value  z,  and  z2  has  value z2, 
withz,  >  z2,  is 


P„(x) 


U(z2  -jc). 


cn(z2) 

The  corresponding  conditional  MGF  is 


(20) 


j* 


P"Wex p(/l*)  =  £^,  n*m,k. 


Cn(zl) 


Cn(zl) 


(21) 


The  conditional  PDF  of  the  sum  z  3  of  the  remaining  RV  s  (other  than  xm,x(t)at  argument  z  3 , 
given thatZj  =Zj  andz2  =z2,is 


J  „=,  c„(z2 ) 

n*m,k 


(22) 
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The  product  of  equations  (17)  and  (22)  and  dz3  is 

1  r  N 

dz i  dz2  dz3  pm  (z, )  pk  (z2 )  —  JdA  exp(-A  z3 )  ]"] c„ (zi ,  ■ X)  U(zl  -  z2 ),  m  *  k,  (23; 

l^n  C  n= ! 

n*m,k 

which  is  the  probability  thatx„,  is  the  largest  RV,  thatx*  is  the  second-largest  RV,  thatz,  =  xm 

lies  in(z,,z,  +  c/zl),thatz2  =  xAlies  in(z2,z2  +dz2), and  that  sum z3  lies  in(z3,z3  +dz3).  That  is, 
for //?  *  k, 

q 3 (w> k,zx,z2,z3)  =  U(zx  -  z2 )  pm (z, )  pk (z2 )  jdA  exp(-A z3 )  c„ (z2 , A)  (24; 


is  the  combined  probability  (thatx,,,  is  the  largest  RV  and  thatx*  is  the  second-largest  RV)  and 
joint  PDF  of  z,  (=  x,„),  z2  (=  xk ),  and  z3  (the  sum  of  the  remaining  RVs). 

Let  A  =  iy  in  equation  (24)  and  integrate  on  z3  to  get 

f  N 

]dz2  q3(m,k,zx,z2,z3)  =  U(zx  -z2)  p„,(z])  pk{z2)  f^[c„(z2)-  G 


Then,  by  reference  to  equations  (17)  and  (18),  the  remaining  two  integrals  yield 

jffdz,  dz2  dz3  q3 (777, *,  z, ,  z2 ,  z3 )  =  Q{m, k),  m±k.  (26) 

That  is,  function  q3(m,k,zx,z2,z3 )  in  equation  (24)  is  not  a  true  PDF  because  its  area  is  less  than 
1.  However,  the  conditional  PDF  ofz(,z2,z3,  given  thatzra  is  the  largest  RV  and  thatz*is  the 
second-largest  RV,  is 

93  (z,  ,  Z2  ,  Z3  I  777,  k)  =  -~'k)  U  (Z.  -  *2  )  Pm  (Zl  )  Pk  (Z2  ) 

1  r  N  (27) 

JdA  exp(-A  z3 )  f]c«(z2^). 


which  is  a  true  PDF  for  all  tt?  *  k,  m,  k  =  1  :N. 
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Alternatively,  the  sum  of  equation  (24)  over  all  w  ^  k. 


ty 3  ?  Z2  ’  “  2^  ?  ^2  ?  ^3  ) 


1  N  N  N 

=  U{z 1  -  z2 )  — -  JdA  exp(~/l  Z3 )  £  pm  (zl )  X  Pk  (z2 )  IT c«  (z2  ’ 


£=1  «=1 
n*m,k 


is  a  true  PDF,  namely,  the  unconditional  PDF  of  the  largest  RV  z,,the  second-largest  RVz2,and 
the  sum  of  the  remaining  RVsz3.  By  using  equation  (3),  and  adding  and  subtracting  the  k=  m 
term  in  the  inner  sum,  this  quantity  can  be  expressed  as 

q3(zx,z2,z3)  =  U(zx-z2)-^—  \dA  ex?(-Az3)  P(z2,A)  (Sx  S2-S3),  (29] 

iln  ^ 

where  one-dimensional  sums 

o  yp  PAZ  |)  c  -V  P"(Z  2)  y  V'  Pn  (Z1 )  Pn  (z2 )  HO 


tfc„(z2,i)  £fc„(z2,A) 


Equation  (29)  is  much  more  advantageous  computationally  than  equation  (28),  which  requires  a 
nested  three-dimensional  sum  and  product.  Joint  PDF  q3(zx,z2,z3)  is  zero  for  z3  >  ( N  -  2)  z2 

because  RV  z3  <  (N  -  2)  z2  is  always  true. 

As  a  check,  if  all  the  RVs{x„}  are  identically  distributed,  the  result  in  equation  (28)  reduces 


N(N- 1)  p(zx)  p(z2 )  ~  \dA  exp (-Az3)  c(z2,A)N~2  U(zx  -z2), 

i/rr  J 


which  is  equation  (35)  of  reference  1.  Also,  by  letting  A  =  iy  in  equation  (31)  and  integrating  on 
z3 ,  there  follows 

N (N  - 1)  p(zx )  p(z2 )  c(z2  )N~2  U(zx-z2).  (32 

An  additional  integral  on  z2  yields 


N(N- 1)  p(zx)  jdz2  p(z2)c(z2)N-2  =  N p(zx)c(zx)N~'  =— c(z,); 


where  the  step  function  U(zx  -  z2)  takes  effect.  Finally,  integrating  on  z,  yields  1. 


THREE  LARGEST  RANDOM  VARIABLES 


The  probability  that  RV  x,„  is  the  largest  RV  and  lies  in  interval  (z, ,  z,  +  dzx ),  that  xk  is  the 
second-largest  RV  and  lies  in(z2,z2  +  dz2), and  thatx7  is  the  third-largest  RV  and  lies  in 
(z3,z3 +</z3),withz,  >z2  >z3,is 

N 

dz ,  dz 2  dz3  pm (z{)pk(z2)  Pj (z3 )  Yl c„ (z3 )  U(zl-z2)U(z2-z3)  (34) 

n= 1 

n#m,k,j 

for  m,k,j  all  unequal.  Then,  the  probability  thatx,,,  is  the  largest  RV,  thatxA  is  the  second-largest 
RV,  and  thatx7  is  the  third-largest  RV  is  obtained  by  integrating  equation  (34)  first  onz,to  obtain 

00  N 

Q(m,k,j)=  jdz3Pj(z3)  fj  {c„(z3)}  fdz2  pk(z2)ejz2).  (35) 

"='  z3 

At  this  point,  in  general,  the  remaining  double  integral  cannot  be  reduced  any  further,  although 
the  sum  of  all  the  {Q(m,  k,  j)}  over  all  unequal  m,  k,  j  must  be  unity.  However,  for  the  example  of 
exponential  RVs  in  equation  (5),  the  z2  integral  can  be  carried  out  to  yield 

ak  o  °°f  m 

Q{m,k,j)  =  — -L-  jdz  exp[-(am  +ak  +ay)z]  [3{l-exp(-a„  z)}.  (36) 

Qm  +  °k  0  w=l 

n*m,k,j 

For  given  numerical  values  of{oH},  this  single  integral  can  be  easily  evaluated  for  any  m,k,j  of 
interest. 

For  general  statistics  of  RVs{xn  },and  by  a  similar  procedure  to  that  presented  in  equations 
(20)  through  (24),  the  combined  probability  and  joint  PDF  of  z,  (=  x,„),  z2  (=  xk),  z3  (=  x7),and 
z4  (the  sum  of  the  remaining  RVs)  is 


Q 4  (w,  k,  j,  Zj ,  z2 ,  z3 ,  z4 )  =  t/(z, -z2)U(z2-z3)  pm  (z, )  pk  (z2 )  Pj  (z3 ) 

1  r  N 

X/2^r  j^exP(_^z4)  WC,XZ3^)  for  m, k,  j  all  unequal. 

1  ('  n=\ 

n*m,k,j 


(37) 


Function is  zero  for z4  >(N-3)z3  because  RVz4  <(N- 3)  z3  is  always  true.  Equation  (37) 
can  be  evaluated  numerically  with  moderate  computational  effort. 

The  sum  of  equation  (37)  over  all  unequal  m,k,j  is  the  unconditional  joint  PDF  of  RVs 
z , ,  z  2 ,  z3 ,  and  z  4  and  can  be  expressed  as 
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\ 


q4  (z, ,  z2 ,  z3 ,  z4 )  =  !/■ (z,  -  z2 )  U(z2  -  z3 )  \dA  exp (-A  z4 )  P(z3  ,A)T3, 


where 

t -t(7  z  z  r>  =  Y  y  PjShl  y  (3 

i3  —  T (Zj  9Z2?^3?/l)  ^  /_  ^  ~  /_  ^ 

m=l  Cm(Z3’^)  k=\  Ck(Z3>*')  M  Cj\Z3**') 
k*m  j^mjc 

By  adding  and  subtracting  the  missing  terms  in  each  sum,  starting  with  the  innermost  sum  and 
expanding  out  the  resulting  expressions,  the  following  form  is  obtained: 

T3=SX  S2  S3-Sx  S6-S2  S5-S3  S4+2S7,  (4 


where 


S{  —  S2  -^b„,  S3  -^c„,  S4  ^ja„b„, 

n  n  n  rt 

$5  =^janCn’  S6  ='^JbnCn,  S1-'Zamb.CB. 

n  n  n 


(41) 


and 


PjJi)  b  _  PShl  c 
(z3,a)’  -  c^z^y  - 


pM 

ck(z3>A) 


for  n  =  1 :  N. 


(42) 


Whereas  direct  evaluation  of  equation  (39)  would  require  a  triple-nested  sum,  requiring  a  number 
of  operations  of  the  order  of  N3 ,  equation  (40)  requires  only  the  seven  one-dimensional  sums  in 
equation  (41),  each  of  size  N. 

The  PDF  q4  (z, , z2 , z3 ,  z4 )  in  equation  (38)  is  zero  for  z4  >  (N  -  3)  z3  because  RV 
z4  <(N-  3)z3  is  always  true. 


9 


FOUR  LARGEST  RANDOM  VARIABLES 


The  probability  that  RV  xm  is  the  largest  RV  and  lies  in  interval  (z, ,  z,  +  dz} ),  that  xk  is  the 
second-largest  RV  and  lies  in(z2,z2  +  Jz2),thatxy  is  the  third-largest  RV  and  lies  in 
(z3 ,  z3  +  ^/z3 ),  and  that  x/  is  the  fourth-largest  RV  and  lies  in  (z4 ,  z4  +  dz4 ),  where 
z,  >z2  >z3  >  z4,is 

N 

dz .  dzi  dzi  dz 4  P,„  ( Z , )  p*  (z2 )  p,  (z3 )  p,  (z4 )  [7  c„  (z4 )  t/(z,  -  z2 )  t/(z2  -  z3 )  U (z3  -  z4 )  (43) 

>7=1 

form,k,j,ia\\  unequal.  Then,  the  probability  that  xm  is  the  largest  RV,  thatx*  is  the  second- 
largest  RV,  thatxy  is  the  third-largest  RV,  and  thatx,  is  the  fourth-largest  RV  is  obtained  by 
integrating  equation  (43)  first  onz,  to  obtain 

00  J\f  00  CO 

Q(m,k,j,i)  =  fdz4  p,(z4)  f]  (c„(z4)}  fdz3  Pj(z3)  fdz2  pk(z2)  em(z2).  (44) 

-x  z4  z. 

At  this  point,  in  general,  the  remaining  triple  integral  cannot  be  reduced  any  further,  although  the 
sum  of  all  the  {Q(m,  k,  j,  /)}  over  all  unequal  m,  k,  j,  i  must  be  unity.  However,  for  the  example  of 
exponential  RVs  in  equation  (5),  the  z2  and  z3  integrals  can  be  carried  out  to  yield 


Q(m,k,j,i)  = 


ak  cij  a, 


+ak)(am  +ak  +«7) 

*  N 

x  j  dz  exp  [~(am+ak  +ay  +a,)z]  ]_[{l-exp(-a,)  z)}. 


For  given  numerical  values  of{a()},this  single  integral  can  be  easily  evaluated  for  any  m,k,j,i  of 
interest. 


For  general  statistics  of  RVs{x„},and  using  a  procedure  similar  to  that  presented  in 
equations  (20)  through  (24),  the  combined  probability  and  joint  PDF  of 
zi  (=  x/«)’  z2  (=  x*)>  Z3  (=  xy-)»  Z4  (=  */),  andz5  (the  sum  of  the  remaining  RVs)  is 
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Function is  zero  for  z5  >  (N  -  4)  z4  because  RV z5  <(N-  4)  z4  is  always  true.  Equation  (46) 
can  be  evaluated  numerically  with  moderate  computational  effort. 

The  sum  of  equation  (46)  over  all  unequal  m,  k,  j,i  is  the  unconditional  joint  PDF  of  RVs 
z,,z2,z3,z4,andz5  and  is  given  by 

q5(zi,...,z5)  =  U(zl  -  z2 )  U(z2  -  z3 )  U(z3  -  z4 )  \dX  exp(-^ z5 )  P(zA ,X)TA,  (47) 

where,  after  expansion  and  manipulation,  the  fourth-order  sum  for  TA  can  be  expressed  as 

Ta  =  T(zx,z2,z3,za,A)  =  Sx  S2  S3  Sa+2(Sx  SlA+S2  Su+S3  Sn+SA  Sn) 

+  S5  Sxo  +S6  S9  +  S7  Ss  —  Sx  (S7  Sxo  +  S3S9  +  SA  Ss)  —  S2  (S3  S7  +SA  S6)  (48) 

-S2SAS5-6Sl5, 

and  the  15  sums  are  given  by 

61]  S2  =  yybn,  S3  =  ^cn,  SA  =  ^d„, 

n  n  n  n 

$5  $6  ~'^LianCn'>  ^7  ~~^j^nCn^ 

n  n  n  n  (49) 

£9  =^jbndni  ^11  =  Cn>  Sn  —  ^J,an 

n  n  n  n 

$13  =  ^jan  Cn  d„>  cn  dn,  SX5=^a„bncndn, 


a  =  PjS^L  b  =  Pn^  ,  c  d  =  P,'{Za)-  foxn  =  \\N.  (51 

”  c„(za,A)’  "  c„(za,A)  ”  cn(z4,A)  "  c„(z4,A ) 

Again,  the  simplification  afforded  by  form  (48)  is  a  considerable  improvement  on  the  initial 
quadruple  nested  sum  encountered  for  TA  in  equation  (47).  All  1 5  sums  in  equation  (49)  are  of 
size  N.  An  additional  shortcut  is  available  by  defining  en  -  an  bn  and/„  =  cn  dn  for  n  =  1  :N  in 
all  the  sums  in  equation  (49). 

The  PDF  q5  (z, ,  z2 ,  z3 ,  z4 ,  z5 )  in  equation  (47)  is  zero  for  z5  >(N-  4)  z4  because  RV 
z5  <  (N  -4)z4  is  always  true. 
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M- 1  LARGEST  RANDOM  VARIABLES 

Let  =  zi  be  the  largest  RV,  xnij  =  z2  be  the  second-largest  RV,  and  xfflu  (  =  zA/_,  be  the 
(Af-l)-th-largest  RV.  Also,  let  zA/ be  the  sum  of  the  remaining  RVs.  Then, 

(Ar  +  l-M)zA/_,  is  always  true. 

Observation  of  equation  (46)  reveals  that  the  combined  probability  and  joint  PDF  of  the  M 
RVs  {zm},  m  =  1:A/,  is  given  by 

M -2  AY-1 

^A/(w,,...,Wa/_|;Zi,...,Za/)  =  ]”J{£/(zm  ~  Zm+|)}  J W\{Pm)(,Zj)} 

m= 1  7=1 

X  /2^  exp(— ^  ZA/  )  -1 » ^  )/ ]~ [  {Cm7  (^A/  1 '  } 

c  /  7=1 

For  this  quantity  to  be  nonzero,  it  is  required  that 
z,  >z2  >->zAM  and  zu  <  (JV  +  1 -M)  zMA. 


(51) 


(52) 


RECURSION  FOR  m  -TH  ORDER  SUM 

As  A/-1,  the  number  of  the  largest  RVs  of  interest,  increases,  the  initial  form  for  the  nested 
sum Tm  =  F(z,,...,zm,/l),  m  =  M-l,  becomes  impractical  computationally.  Also,  the  expansion 
and  simplification  procedure  leading  to  compact  equations  (40)  and  (48)  for  m  =  3  and  m  =  4, 
respectively,  becomes  very  tedious  and  unwieldy.  A  method  around  these  limitations  is  to 
develop  a  recursion  procedure  for  getting  Tm  directly  from  Tm  , . 

The  essentials  of  this  derivation  begin  with  the  definition 

A'  N  N  N  N 

A2(a,b)  s  £a„  ^bm  =  =  sum(d)*  sum(b)  -  sum(a*  b).  (53) 

n- 1  m=1  n=l  ffj-!  w=l 

m*n 


Suppose  a  program  is  written  to  perform  this  task  on  sequences  {a„ }  and  {bn } .  Now,  consider  the 
third-order  nested  sum 

A3  (a,b,c)  =  X  a„  £  bm  £  ck .  (54) 

n=\  m-\  k= 1 

m*n  k*m,n 
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Develop  the  inner  sum  according  to 

N  N  f  N  ^ 

A3  (a,b,c)  —  cin  ^  1  bm  c k  Cm 

n= 1  m= 1  \k- 1  ) 

m*n 

=  sum{c )  *A2(a,b)-  A2  (a.  *c,b )  -  (a,b.*c). 


(55) 


where  the  notation  introduced  in  equation  (53)  has  been  used.  Thus,  A3  can  be  evaluated  by  three 
calls  to  function  A2 .  It  follows  in  a  similar  fashion  that 


N  N  N  N 

A^  b,  c,  d )  =  ) '  Gn  ^ '  bm  ^ '  q  ^  1 

n= 1  m=l  t=l  7=1  (56) 

m*n  k*m,n  j*k,m,n  v  7 

=  sum(d )  *  (a,  b,c)~  A3(a.*  d,b,c)~  A3(a,b.*  d,c)~  A3(a,b,c*  d). 


That  is,  in  general,  Am  can  be  evaluated  by  m  calls  to  Am_x ,  with  appropriate  combinations  of 
arguments. 

It  should  be  observed  that  the  use  of  the  recursive  approach  is  not  as  economical  as  having 
explicit  expressions  for  the  high-order  sums  of  interest.  For  example,  the  use  of  equation  (55) 
employs  10  summations,  whereas  the  direct  use  of  equation  (40)  requires  that  only  seven  sums 
be  evaluated.  (Sums  SX,S2,  and  S7  are  evaluated  twice  in  equation  (55).)  This  loss  of  economy  is 

present  at  every  level  and  gets  worse  as  m  increases.  Also,  the  amount  of  computational  effort 
increases  noticeably  with  m.  In  fact,  the  number  of  summations  that  must  be  evaluated  at  level  m 
is  2m  -1;  observe  the  results  in  equations  (30),  (41),  and  (49),  for  example.  This  additional 
effort  serves  to  effectively  limit  the  level  to  which  the  procedure  can  be  carried  out;  that  is, 
evaluation  of  the  joint  PDF  of  the  m  largest  RVs  and  the  sum  of  the  remainder  is  not 
computationally  feasible  for  very  large  m.  Additional  effort  on  extending  results  like  equations 
(48)  through  (50)  would  probably  be  very  worthwhile,  at  least  for  m  =  5  or  6.  For  example,  at 
m  =  5,  the  number  of  different  sums  to  be  evaluated  is  25  - 1  =  3 1,  and  the  number  of  different 
types  of  terms  in  T5  is  52.  Namely, 

T5  =  24  Q(5)  -  6  0(4,1)  -  2  0(3,2)  +  2  0(3,1, 1)  +  0(2, 2,1)  -  0(2, 1,1,1)  +  0(1,1, 1,1,1),  (57) 

where 
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Q(5)  =  ^abcde,  lterm, 

0(4,1)  =  ^abcd  +  5  terms, 

Q(3,2)  =  ^abc^de  +  ---,  lOterms, 

QQ,\,\)  =  'Yjabc'YJd'Yje  +  ---,  lOterms,  (58) 

0(2, 2,1)  =  ^ab  ^cd  ]Te  +  -",  15  terms, 

0(2, 1,1,1)  =  ^ ab  ^d  +  lOterms, 

0(1, 1,1, 1,1)  =  ^a  ^b  1  term. 


SECOND-LARGEST  RANDOM  VARIABLE 

The  probability  thatx*  is  the  largest  RV,  thatx;  is  the  second-largest  RV,  and  that 
Xj  e(z,,2,  +dzx) is 

N 

dzx  Pj  (zi )  **  (z, )  n  c»  )> k*  j- 


Given  that  x,  =  z, ,  the  conditional  PDF  ofx*  is^^  U(x  -  z, ),  while  that  forx„,  n  *  j,k,  i 

ek(z\) 

~r  ~  ^ (z,  -  *)•  The  corresponding  conditional  MGF  ofx.  is 

c»(zi) 

frit  £t^H.  exp(A  x)  m 
zf  ek(Z\ )  «*Ul) 

and  that  ofx.,  is 


}&  exp(^x)  =  „  *  JX 

-t  C,XZ\ )  c„  (z, ) 
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JY 

The  conditional  MFG  ofz2  =  xk  +  ^x„  is 


(zp^-)  A c»(z lvjO 
g*Oi)  L  c„(^i) 

n*j,k 

The  conditional  PDF  ofz2at  argument  z2,  given  z,  =  x  has  value  z,,  is 


a*  > 


ek(Zx)  «=i  g«(zi) 

n*j,k 


Finally,  the  product  of  equations  (59)  and  (63)  and  <iz2  is 
1  N 

dzx  dz2  pj  (Zj )  —  f 'dX  exp(-A  z2 )  ek  (z, ,  X)  J}  c„  (zx  ,X),  k*  j, 
i2n  •  „= i 

which  is  the  probability  thatx*  is  the  largest  RV,  thatxy  is  the  second-largest  RV,  that 
Zj  =  Xj  e(z,,z,  +  dzx),  and  that z2  e(z2,z2  +  dz2).  That  is, 

1  N 

q2(j,k,zl,z2)  =  Pj (z, )  —  exp(-T z2)  ek(zx,X)\\cn{zx, X),  *  *  y. 


is  the  combined  probability  (thatx*  is  the  largest  RV  and  thatxy  is  the  second-largest  RV)  and 
joint  PDF  ofzpthe  second-largest  RV,  andz2,the  sum  of  the  remaining  RVs. 

The  sum  of  equation  (65)  over  k, 

j  N  N 

^2(y,z,5z2)  =  Pj(zx)  [dX  exp(-Az2)  ^ek(zx,A)  f  Jcm(zltX),  0 


is  the  combined  probability  (thatxy  is  the  second-largest  RV)  and  joint  PDF  ofz,  (=  xy )  andz2 
(the  sum  of  the  remaining  RVs).  This  quantity  can  be  written  as 

^20*>Zi,z2)  =  Pj(z\)~~  fa  exp(-X  z2 )  S  T77~jT  forJ=l:N-  (6 

i2n  •  cAzx,A)  *=1  ck(Zj,A) 
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Then,  the  sum  of  equation  (67)  over  j, 


lM  expMZi)  P(2-A) 

c  j= 1  Cj(z  1>^)  *=1  C*(Z1’^) 

k*j 


is  the  overall  joint  PDF  of  the  second-largest  RVz,  and  the  sum  of  the  remaining  RVs  z2.  This 
last  quantity  can  be  expressed  alternatively  as 


q 2  =  ^  \dX  eXP(~X  Z2  )  P(Z\ >  *)  (5,  52  -  Si  ), 


where  one-dimensional  sums 


£  _  y  Pn  )  g  _  y  g»  (^1  >  ‘l^')  ^  _  V1  Pn  (z\ )  g»  iZ]  i  jO 

n=l  Cn(Z|!^)  n=!  Cn(Z|>-^)  «=]  Cn(zl5A)2 


SUMMARY 

The  joint  statistics  of  M-l  ordered  random  variables  and  the  sum  of  the  remaining  random 
variables  have  been  derived  for  several  values  of  low-order  M.  The  original  random  variables, 
prior  to  ordering,  are  independent  and  can  have  arbitrary,  different  probability  density  functions. 
Results  for  the  joint  probability  density  function  of  the  M  random  variables  of  interest,  as  well  as 
a  combined  probability  and  joint  probability  density  function,  have  been  derived  in  the  form  of  a 
single  contour  integral  in  the  moment-generating  domain.  Numerical  evaluation  of  this  contour 
integral  is  most  easily  accomplished  by  approximately  locating  the  real  saddlepoint  of  the 
integrand  and  moving  the  Bromwich  contour  so  as  to  pass  through  this  point.  However,  instead 
of  resorting  to  a  saddlepoint  approximation,  high  accuracy  in  the  evaluation  of  the  joint 
probability  density  function  is  achievable  by  numerical  integration  along  this  displaced  contour. 

A  recursive  procedure  has  been  developed  for  evaluating  a  nested  sum  that  occurs  in  the 
evaluation  of  the  joint  probability  density  function.  For  values  of  M  in  the  range  of  6  to  10,  this 
is  a  very  helpful  numerical  aid.  For  much  larger  values  of  M,  execution  time  increases  very 
rapidly  and  becomes  a  significant  limitation. 


REFERENCES 


1 .  A.  H.  Nuttall,  “Joint  Probability  Density  Function  of  Selected  Order  Statistics  and  the  Sum 
of  the  Remaining  Random  Variables,”  NUWC-NPT  Technical  Report  1 1,345,  Naval 
Undersea  Warfare  Center  Division,  Newport,  RI,  15  January  2002. 


16 


INITIAL  DISTRIBUTION  LIST 


Addressee  No.  of  Copies 

Office  of  Naval  Research  (D.  Abraham,  J.  Tague,  B.  Fitch)  3 

University  of  Connecticut  (P.  Willett)  1 

University  of  Rhode  Island  (S.  Kay)  1 

Defense  Technical  Information  Center  2 


